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Abstract. In this paper we investigate the incorporation of information about 
the users’ personality into a number of collaborative filtering methods, aiming 
to address situations of user preference scarcity. Through empirical experiments 
on a multi-domain dataset obtained from Facebook, we show that the proposed 
personality-aware collaborative filtering methods effectively -and consistently 
in the studied domains- increase recommendation performance, in terms of 
both precision and recall. We also present an analysis of relationships existing 
between user preferences and personality for the different domains, considering 
the users’ gender and age. 
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1 Introduction 

Collaborative filtering has been shown to be one of the most successful approaches 
for providing personalized recommendations. It has, however, limitations when there 
is a scarcity of user preferences, which leads to a restricted coverage of the user-item 
preference space (sparsity problem) and a weak capability of recommending items to 
a new user (cold start problem). 

Aiming to address this situation, approaches have been proposed to exploit 
additional information about users and items, increasing the density of the user-item 
preference space or capturing user similarities that do not (merely) depend on personal 
preferences. Among the existing input signals that have been exploited to establish 
those similarities, psychological aspects -such as emotions, moods and personality- 
have been gaining increasing attention by the recommender systems community [12]. 

Personality, as defined in psychology, is an organized and dynamic set of 
characteristics possessed by a person that uniquely influence his or her cognitions, 
motivations, emotions and behaviors in certain situations. In particular, recent studies 
have shown that personality -accounting for user response patterns- correlates with 
user preferences in different domains, such as music [14, 16], movies and TV shows 
[3, 13], books and magazines [2, 15], and websites [10]. These correlations are the 
basis with which traits and types of personality may help explain why people prefer 
one option to other, and could be used to improve personalization services. 
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In fact, several authors [9, 11, 17,18] have already explored how user personality can 
be exploited to improve collaborative filtering recommendations. However, due to the 
current difficulty of obtaining information about both user preferences and personality, 
proposed approaches have been evaluated with relatively small datasets in single 
domains. Addressing these limitations, in this paper we present an empirical study 
comparing state of art and novel personality-aware collaborative filtering methods with 
a large multi-domain dataset obtained from Facebook. We show that the proposed 
methods effectively -and consistently in the studied domains- increase recommendation 
performance, in terms of both precision and recall. Moreover, differently to previous 
work, we present an analysis of relationships between user preferences and personality 
in several domains, considering the users’ gender and age. We show that these 
demographic attributes lead to different user preference-personality patterns, which calls 
for further work in the personality-aware recommendation research agenda. 

The reminder of the paper is organized as follows. Section 2 reviews related work. 
Section 3 presents the evaluated personality-aware collaborative filtering methods, and 
Section 4 reports and discusses obtained experimental results. Complementing these 
results, Section 5 analyzes relationships between user preferences and personality in the 
studied domains. Finally, Section 6 ends with conclusions and future research lines. 

2 Related Work 

2.1 Personality Modeling 

Different models have been proposed to characterize and represent human personality. 
Among them, the Five Factor (FF) model [4] is considered one of the most 
comprehensive, and has been the mostly used to build user personality profiles [9], The 
FF model establishes five broad domains or dimensions -called factors and 
commonly known as the Big Five- to describe human personality: openness, 
conscientiousness, extraversion, agreeableness and neuroticism. 

The openness (OPE) factor reflects a person’s tendency to intellectual curiosity, 
creativity and preference for novelty and variety of experiences. The conscientiousness 
(COS) factor reflects a person’s tendency to show self-discipline and aim for personal 
achievements, and have an organized (not spontaneous) and dependable behavior. The 
extraversion (EXT) factor reflects a person’s tendency to seek stimulation in the 
company of others, and put energy in finding positive emotions. The agreeableness 
(AGR) factor reflects a person’s tendency to be kind, concerned, truthful and 
cooperative towards others. Finally, the neuroticism (NEU) factor reflects a person’s 
tendency to experience unpleasant emotions, and refers to the degree of emotional 
stability and impulse control. 

The measurement of the FFs comprises items that are self-descriptive sentences or 
adjectives, commonly presented in the form of short tests. In this context, the 
International Personality Item Pool 1 (IPIP) is a publicly available collection of items 
for use in psychometric tests, and the 20-100 item IPIP proxy for Costa and McCrae’s 


1 International Personality Item Pool (IPIP), http: / / ip.i p. ori . org 
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NEO-PI-R test [5] is one of the most popular and widely accepted questionnaires to 
measure the Big Five in adult men and women without overt psychopathology. 

In this paper we use the FF model to represent user personality. In particular, we 
utilize the dataset provided by myPersonality project 2 [1], which has personality and 
entertainment preference profiles of Facebook users. The dataset contains the FF 
scores of 3.1 million users, collected by using 20 to 336 item IPIP questionnaires. 
With this dataset we aim to exploit the relationships between user preferences and 
personality for recommendation. 

2.2 Relationships between User Preferences and Personality 

Personality influences how people make their decisions [12]. In particular, it has been 
shown that people with similar personality characteristics are likely to have similar 
preferences. In [16] Rentfrow and Gosling investigated how music preferences are 
related with personality in terms of the FF model, showing that “reflective” people 
with high openness usually have preferences for jazz, blues and classical music, and 
“energetic” people with high degree of extraversion and agreeableness usually 
appreciate rap, hip-hop, funk and electronic music. Also in the music domain and using 
the FF model, Rawlings and Ciancarelli [14] observed that openness and extraversion 
are the personality factors that best explain the variance in personal preferences. They 
showed that people with high openness tend to like diverse music styles, and people 
with high extraversion are likely to have preferences for popular music. 

In the movie domain, Chausson [3] presented a study showing that people open to 
experiences are likely to prefer comedy and fantasy movies, conscientious individuals 
are more inclined to enjoy action movies, and neurotic people tend to like romantic 
movies. Afterwards, Odi et al. [13] explored the relationships between personality 
factors and induced emotions in movies for different social contexts -e.g. watching a 
movie alone or with someone else-, and observed different patterns in experienced 
emotions as functions of the extraversion, agreeableness and neuroticism factors. 

Extending the spectrum of analyzed domains, in [15] Rentfrow et al. investigated 
the relations between personality factors and user preferences in several entertainment 
domains, namely movies, TV shows, books, magazines and music. They focused their 
study on five personality-based content categories: aesthetic, cerebral, communal, 
dark and thrilling. The authors observed positive and negative relationships between 
such categories and some of the personality factors, e.g., they showed that aesthetic 
contents relate positively with agreeableness and negatively with neuroticism, and 
that cerebral contents correlate with extraversion. Also considering several domains - 
movies, TV shows, books and music-, in [2] we presented a preliminary study on the 
relations between personality types and entertainment preferences. Analyzing a large 
dataset of personality factor and domain genre preference user profiles, we extracted 
personality-based user stereotypes for each genre, and inferred association rules and 
similarities between types of personality of people with preferences for particular 
genres. Finally, in the multi-domain scenario of the Web, Kosinski et al. [10] 


myPersonality project, http: / /mypersonality. org 
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presented a study revealing meaningful psychologically relationships between user 
preferences and personality for certain websites and website categories. 

Continuing our previous work [2], in this paper we analyze the relationships 
between user preferences and personality in multiple entertainment domains. Here we 
extend the number of considered domain genres and, differently from other studies, 
conduct the analysis by clustering the user profiles according to demographic 
attributes, namely gender and age. We show that there are differences in the 
correlations between user preferences and personality depending on such attributes. 


2.3 Personality-Aware Recommender Systems 

Once extracted, personality factors could be used to build personality-based user 
profiles that may be exploited by personalized information retrieval and filtering 
approaches [8, 11], In [18] Tkalcic et al. applied and evaluated three user similarity 
metrics for the heuristic-based collaborative filtering strategy: a typical rating-based 
similarity, a similarity based on the Euclidean distance with FF data, and a similarity 
based on a weighted Euclidean distance with the FF data. Their reported results 
showed that approaches using FF data perform statistically equivalent or better than the 
rating-based approach, especially in cold-start situations. In her PhD dissertation [11], 
Nunes explored the use of personality user profile composed of NEO-IPIP items and 
facets in addition to the Big Five factors, showing that fine-grained personality user 
profiles help achieve better recommendations. Following the findings of Rentfrow and 
Gosling [16], in [9] Hu and Pu presented a collaborative filtering approach based on the 
correlations between personality types and music preferences, in which the similarity 
between two users is estimated by means of the Pearson’s correlation coefficient on the 
users’ FF scores. Combining this approach with a rating-based collaborative filtering, 
the authors showed significant improvements over the baseline of considering only 
ratings data. In [17] Roshchina proposed an approach that extracts FF-based profiles by 
analyzing hotel reviews written by users, and incorporates these profiles into a nearest 
neighbor algorithm to enhance personalized recommendations. Finally, in [6] Elahi et 
al. presented an active learning technique that exploits user personality to accurately 
acquire user ratings for collaborative filtering in cold-start situations. 

Instead of using the Euclidean distance as done in [18], in this paper we propose to 
use the cosine-based distance to establish (dis)similarities between users in the 
collaborative filtering strategy for both preference- and personality-based user 
profiles. Similarly to [11], we explore alternative representations of personality user 
profiles, but focusing on Big Five data instead of considering the large number of 
NEO-IPIP items and facets. For comparison purposes, we implement and evaluate the 
Pearson-based user similarity presented in [9], but we also assess a user similarity 
based on Spearman’s correlation, aiming to capture non-linear relationships between 
user personality factors. Finally, as done in [17], we evaluate collaborative filtering 
strategies based on nearest neighbors, but differently to that work, in this paper we 
report recommendation performance results for several neighborhood sizes, and 
evaluate hybrid approaches that combine user preference and personality data. 
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3 Personality-Aware Collaborative Filtering Methods 


In the recommender systems literature, the preference of a user u £ 'll for certain 
item i £ 3 is commonly represented as a rating r u l £ H , where Jl is a totally 
ordered set, e.g. non negative integers or real numbers within a certain range. For 
each user u, a recommender system aims to predict ratings r u i of items i unrated by 

u, and suggest the items with the highest predicted ratings r u l . 

In particular, heuristic-based collaborative filtering strategies compute r u l as an 
aggregate of the ratings of some other (usually, the most similar) users v. More 
formally, r u l = aggr UEWu r vi , where K u denotes the set of N users who are the most 
similar to u, and is usually referred as the set of neighbors or neighborhood. Several 
aggregation functions aggr have been proposed in the literature. One of the most 
widely used, and that we adopt in this paper (for unary/binary ratings) is: 

f ul = *u + K ^ sim(u,v ) • ( r v i - f£) 

where sim(u, v) is a function that measures the similarity between two users u and 

v, is the average value of user u’ratings, and k is a normalization factor, which is 
usually set as k = ^-/Yav€K u sim(u, v). 

Together with the rating aggregate function, the user similarity function is a central 
component that characterizes a collaborative filtering strategy. A standard similarity 
metric is based on the Pearson’s correlation coefficient: 


simQu.v) = 


T.i e o u m v (ru,i-ruXr v ,i-r v ) 

VZi6J u nJ v ( r u,i — r u)^ £ieJ u nJ„( r v,i — r v) 2 


where J u E 7 is the set of items rated by user u. This metric is not applicable when 
the ratings are either unary (such as the Facebook’s likes) or binary (such as the 
YouTube’s thumps up/down). In these cases, a user similarity based on the laccard’s 
coefficient is commonly used: 

\J U n 0 V \ 

= M (1) 

This is in fact the similarity metric we consider as baseline in our experiments. We 
shall refer the collaborative filtering strategy that utilizes sim pre f(u,v ), without 
considering the users’ personality, as CF. Analogously to [9], in order to incorporate 
personality information into the recommendation process, we shall study a hybrid 
recommendation strategy that linearly combines sim pre f(u,v) with a personality- 
based user similarity sim pers (u,v') as follows: 

sim(u, v) = A • sim pre f(u, v) + (1 - A) • sim pers (u, v) (2) 

where A £ [0,1] controls the influence of user preferences and personality on the 
recommendation process. For A values close to 1, user preferences are more relevant, 
while for A values close to 0, personality gets higher relevance. In the subsequent 
sections, we propose several formulations of sim pers (u,v') that yield the personality- 
aware collaborative filtering methods we empirically compare. 



130 


I. Femandez-Tobfas and I. Cantador 


Let p u = {Vu,ope’Pu,cos’Pu,ext’Vu,agr>Vu,neu) be the user u’s personality-based 
profile composed of u’s Big Five scores. Previous approaches have considered the 
components p uk ranging in a numeric interval, e.g. [1,5] and [1,100], as derived 
from the IPIP questionnaires. Here, we consider this representation as well, and refer 
it as continuous personality profile. Nonetheless, aiming to explore simpler 
personality-aware user profiles and similarities, we also propose to study a personality 
representation in which each component p uk takes one of a limited set of categorical 
values -namely low, medium, and high-, obtained from the corresponding original 
Big Five scores. We refer this representation as discrete personality profile. 

3.1 Cosine-Based Personality User Similarity Methods (COS) 

The first personality-based user similarity we evaluate is the cosine-based similarity: 



Its incorporation in formula (2) produces a recommendation method that we call 
COS. Depending on whether continuous or discrete personality profiles are used, we 
shall refer to COS-c and COS-d methods, respectively. These methods will have 
different implementations depending on the A value used, i.e., COS-c-A and COS-d- 
A , with 0 < A < 1. The case of A = 1 (exploiting only user preferences) is 
equivalent to the CF method, and the case of A = 0 will be referred as COS-c-pers 
and COS-d-pers, since only user personality information is exploited. 


3.2 Pearson-Based Personality User Similarity Methods (PEA) 

The second personality-based user similarity we evaluate was proposed by Hu and Pu 
in [9], and is based on the Pearson’s correlation coefficient: 

Y. k {Pu, k -K)(JPv, k -P~v) 


*,(«,?) = 


(4) 


^ k {Pu, k ~p-u) 2 Y, k {p v , k -%) 2 
Its integration in (2) produces a recommendation method that we call PEA. 
Similarly to the COS case, for the PEA similarity, we shall evaluate the PEA-c-per, 
PEA-d-per, PEA-c-A and PEA-d-A methods, with 0 < A < 1. 


3.3 Spearman-Based Personality User Similarity Methods (SPE) 


The third and last personality-based user similarity we propose is based on the 
Spearman’s correlation coefficient: 


simpers spe (M,v) = 


ZfcOu.fc - S u )(S V/k ~ S v ) 

VlfcOlU “ 5x) 2 Zk(sp,k - v) 2 


(5) 


where s uk is the position of p uk in the decreasing order ranking of u’s Big Five 
scores. Analogously to previous cases, the incorporation of this similarity into (2) 
produces a recommendation method that we call SPE, and which will be instantiated 
and evaluated in the SPE-c-per, SPE-d-per, SPE-c-A and SPE-d-A methods. 
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4 Experiments 

4.1 Dataset 

The dataset used in our experiments is part of the database made publicly available in 
my Personality project [1], my Personality is a Facebook application with which users 
take psychometric tests and receive feedback on their scores. The users allow the 
application to record personal information from their Facebook profiles, such a 
demographic and geo-location data, likes, status updates, and friendship relations, 
among others. In particular, as of March 2014, the tool has let record a database with 
46 million Facebook likes of 220,000 users for 5.5 million items of diverse nature - 
people (actors, musicians, politicians, sportsmen, writers, etc.), objects (movies, TV 
shows, songs, books, games, etc.), organizations, events, etc.-, and the Big Five 
scores of 3.1 million users, collected using 20 to 336 item IPIP questionnaires. 

Due to the size and complexity of the database, in this paper we restrict our study 
to a subset of its items. Specifically, we selected all likes (ratings) associated to items 
belonging to one of the following 3 categories: Movie genre, Book genre, and 
Musical genre. Thus, for instance, selected items belonging to the Movie genre 
category are movie genres such as comedy, action, adventure, drama, and science 
fiction. Note that we do not take into account a large number of potential valuable 
items, such as particular movies preferred by users. Next, we selected those users of 
the dataset that had ratings for the considered items. Once the items and users were 
selected, we conducted text processing operations to consolidate morphological 
derivations of certain item names (e.g. science fiction, science-fiction, sci-fi, and sf). 
Table 1 shows some statistics about the number of users, items and ratings in the 
considered domains. The minimum, maximum, and average (standard deviation) 
numbers of ratings per user were: 1, 19 and 1.73 (1.14) for movies, 1, 15 and 1.57 
(1.08) for books, and 1, 61 and 3.68 (3.66) for music. In the table, users are grouped by 
gender and by age according to Erikson’s psychosocial stages [7] (see Table 2). 


Table 1 . Statistics of the used dataset (gender and age of some users were not declared) 




all female 


16168 

15251 

17980 


9827 

9919 

10924 


Domain -1- 

(#items) aIl female 


(268) 

(305) 


(1175) 


27921 

(99.36%) 

23882 

(99.49%) 

66079 

(99.69%) 


17073 

(99.35%) 


15717 

(99.48%) 


(99.70%) 



6341 

5332 

7056 


2833 

2672 

3164 


3086 

2916 

3467 


Mikes 


(sparsity) 


108484850 
( 99 . 36 %) | ( 99 . 36 %) 


8165 4105 

(99.50%) | (99.50%) 


27181 11628 

(99.67%) | (99.69%) 


(99.37%) 

4478 

(99.50%) 

12055 

(99.70%) 


middle-aged advanced-aged 

adult adult 


1610 


1577 202 


1898 234 


middle-aged 

adult 


advanced-aged 

adult 


( 99 . 39 %) ( 99 . 43 % ) 

2493 336 

(99.48%) (99.45%) 

5876 654 

( 99 . 74 %) ( 99 . 76 %) 
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As explained in Section 3, we propose to evaluate the recommendation methods 
utilizing both continuous and discrete user personality profiles. Specifically, for 
building the discrete profiles, we transform each original personality factor score in 
[1, 5] into one of the following categories: “low”, “medium”, and “high.” Considering 
the distribution of personality factor scores of all users in the dataset, for each 
personality factor, the low category is assigned to those scores that are below the 33 th 
percentile, the medium category is assigned to those scores that are between the 33 th 
and 66 th percentiles, and the high category is assigned to those scores that are between 
the 66 th and 100 th percentiles. Table 2 shows the obtained intervals. 


Table 2. Considered categories for user ages (left) and personality factors (right) 


Age category 

Age interval 

Personality 

Personality factor categories and intervals 

adolescent 

< 20 years old 

factor 

low 

medium 

high 

young adult 

[20, 39] 

OPE 

[1.00,3.75) 

[3.75,4.25) 

[4.25,5.00] 

middle-aged adult 

[40, 50] 

COS 

[1.00,3.05) 

[3.05, 3.75) 

[3.75,5.00] 

advanced-aged adult 

> 50 years old 

EXT 

[1.00,3.25) 

[3.25,4.00) 

[4.00,5.00] 



AGR 

[1.00,3.25) 

[3.25,4.00) 

[4.00,5.00] 



NEU 

[1.00, 2.50) 

[2.50,3.10) 

[3.10,5.00] 


4.2 Results 

In our experiments we empirically compared 61 methods: CF, COS-c-A, COS-d-A, 
PEA-c-A, PEA-d-A, SPE-c-A, and SPE-d-A, with A = 0,0.1,0.2, ...,0.9. The methods 
were evaluated in the three considered domains, by means of their precision, recall 
and F-measure values for the top k recommendations -P @k, R @k and F @k- with k = 
1,...,5, averaged by 5-times 5-fold cross validation. All methods were executed with 
neighborhood sizes of 5, 10, 15, 20 and 25. Tables 3, 4 and 5 show the methods’ 
configurations with best results (in terms of average F-measure values). All 
differences with CF’s values are statistically significant (2-tailed Wilcoxon, p<0.05). 


Table 3. Avg. P@k, R @k and F @k values achieved by the best performing methods in the 
movies domain. Values higher than CF’s are in hold, and the highest values are underlined. 


Method 

CF _ 


COS-d-pers 


PEA-c-0.2 


PEA-d-0.1 

SPE-c-pers 

SPE-c-0.1 

SPE-d-pers 


| Neighbors | 

20 


10 


10 


to 


P@1 P@2 P@3 P@4 P@5 

.146 .382 .350 ,480 .490 

.099 .242 .290 .339 .353 

■154 .400 .488 .463 .460 

.068 .228 .282 .252 .298 

■149 .405 .433 .456 .532 

.095 .267 .337 .375 .373 

■178 .442 .466 .439 .372 

.061 .210 .302 .280 .321 

.149 .400 .436 .476 .540 

.102 .243 .351 .357 .261 

■162 .380 .429 .463 .300 

.057 .226 .299 .296 .330 

.139 .406 .455 .463 .410 


R@1 R@2 R@3 R@4 R@5 

.025 .140 .247 .268 .298 

.099 .241 .284 .292 .294 

.024 .119 .242 .306 .291 

.068 .228 .282 .252 .298 

■062 .213 .306 .276 .313 

.095 .267 .333 .343 .323 

.071 .236 .350 .340 .314 

.061 .209 .294 .267 .315 

.075 .221 .311 .284 317 

.102 .240 .331 .316 .223 

■089 .257 .322 .347 .227 

.050 .200 .266 .237 .254 

.047 .188 .297 .288 .261 


F@1 F@2 F@3 F@4 F@5 

.043 .205 .290 .344 .371 

.099 .241 .287 .314 .321 

.042 .183 .324 .368 .356 

.068 .228 .282 .252 .298 

■088 .279 .359 .345 .394 

.095 .267 .335 .358 .346 

.102 .308 J00 .383 .341 

.061 .209 .298 .273 .318 

.100 .285 .363 .356 399 

.102 .241 .341 .335 .241 

■115 .307 .368 .397 .258 

.053 .212 .282 .263 .287 

.070 .257 .359 .355 .319 
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Table 4. Avg. P@k, R @k and F @k values achieved by the best performing methods in the 
books domain. Values higher than CF’s are in bold, and the highest values are underlined. 


Method 

Neighbors 

P@1 

P@2 

P@3 

P@4 

P@5 

R@1 

R@2 

R@3 

R@4 

R@5 

F@1 

F@2 

F@3 

F@4 

F@5 

CF 

15 

.097 

.146 

.189 

.227 

.291 

.026 

.081 

.129 

.199 

.245 

.041 .104 .153 .212 .266 

COS-c-pers 

5 

.052 

.102 

.152 

.201 

.243 

.052 

.102 

.150 

.187 

.196 

.052 

.102 

.151 

.194 

.217 

COS-c-O.l 

15 

.095 

.150 

.235 

.292 

.270 

.027 

.081 

.147 

.254 

.219 

.042 

.105 

.181 

.272 

.242 

COS-d-pers 

10 

.035 

.098 

.163 

.236 

.286 

.035 

.098 

.163 

■236 

.286 

.035 

.098 

.163 

.236 

.286 

COS-d-O.3 

10 

.087 

.153 

■236 

.266 

.342 

.029 

.085 

.164 

.205 

.276 

.044 

.109 

.194 

.232 

.305 

PEA-c-pers 

10 

.050 

.088 

.130 

.208 

.266 

.050 

.088 

.130 

.208 

.266 

.050 

.088 

.130 

.208 

.266 

PEA-c-0.2 

10 

.087 

.159 

.196 

.263 

.277 

.041 

.103 

.150 

.216 

.242 

■056 

.125 

.170 

.237 

.258 

PEA-d-pers 

10 

.043 

.100 

.164 

.231 

.256 

.043 

.100 

.164 

.231 

.256 

.043 

.100 

.164 

.231 

.256 

PEA-d-0.5 

10 

.077 

■164 

.221 

.248 

.381 

.023 

.090 

.144 

.209 

■298 

.035 

.116 

.174 

.227 

■334 

SPE-c-pers 

5 

.040 

.105 

.170 

.164 

.218 

.040 

.104 

.169 

.158 

.197 

.040 

.104 

.169 

.161 

.207 

SPE-c-0.5 

20 

.092 

.156 

.184 

.229 

.272 

.036 

.112 

.164 

.221 

.272 

.052 

.130 

.173 

.225 

.272 

SPE-d-pers 

10 

.036 

.083 

.141 

.242 

.272 

.032 

.076 

.129 

.218 

.255 

.034 

.079 

.135 

.229 

.263 

SPE-d-0.3 

10 

.086 

.157 

.222 

.249 

.355 

.031 

.087 

.152 

.173 

.260 

.046 

.112 

.180 

.204 

.300 

Table 5. 

Avg. P@k, R @k and F@k values 

achieved 

iy the best performing methods ir 

the 

music domain. Values higher 

than 

CF’s 

arei 

bold, anc 

the highest values 

re underlined. 


Method 

Neighbors 

P@1 

P@2 

P@3 

P@4 

P@5 

R@1 

R@2 

R@3 

R@4 

R@5 

F@1 

F@2 

F@3 

F@4 

F@5 

CF 

5 

.130 

.192 

.230 

.250 

.247 

.066 

.118 

.148 

.169 

.172 

.088 

.146 

.180 

.202 

.203 

COS-c-pers 

5 

.033 

.066 

.083 

.094 

.107 

.033 

.065 

.083 

.094 

.106 

.033 

.066 

.083 

.094 

.106 

COS-c-O.8 

5 

■135 

■197 

■238 

.248 

.247 

.069 

■120 

■151 

.167 

.172 

■092 

■149 

■184 

.200 

.203 

COS-d-pers 

5 

.067 

.073 

.081 

.080 

.089 

.067 

.073 

.081 

.080 

.088 

.067 

.073 

.081 

.080 

.088 

COS-d-O.7 

5 

.130 

.189 

.219 

.222 

.236 

■070 

.119 

.149 

.164 

.182 

.091 

.146 

.177 

.189 

.206 

PEA-c-pers 

5 

.032 

.058 

.073 

.090 

.099 

.032 

.058 

.073 

.090 

.098 

.032 

.058 

.073 

.090 

.098 

PEA-c-0.9 

5 

.131 

.194 

.234 

.244 

.255 

.067 

.115 

.149 

.167 

.184 

.089 

.144 

.182 

.198 

■214 

PEA-d-pers 

5 

.064 

.071 

.081 

.079 

.091 

.064 

.071 

.081 

.078 

.088 

.064 

.071 

.081 

.078 

.089 

PEA-d-0.9 

5 

.134 

.194 

.228 

.250 

.248 

.069 

.119 

.144 

.169 

.177 

.091 

.148 

.177 

.202 

.206 

SPE-c-pers 

5 

.024 

.060 

.081 

.096 

.101 

.024 

.059 

.081 

.095 

.099 

.024 

.059 

.081 

.095 

.100 

SPE-c-0.9 

5 

.130 

.193 

.227 

.241 

.241 

.067 

.115 

.145 

.163 

.177 

.088 

.144 

.177 

.194 

.204 

SPE-d-pers 

5 

.070 

.074 

.073 

.073 

.089 

.063 

.067 

.066 

.067 

.079 

.066 

.070 

.069 

.070 

.084 

SPE-d-0.9 

5 

.133 

.196 

.224 

.245 

.240 

.063 

.108 

.133 

.155 

.162 

.086 

.139 

.167 

.190 

.193 


In general, for the three domains, the proposed hybrid recommendation methods 
that exploit information about the users’ personality outperform the baseline CF 
method. It has to be noted that, probably due to the scarcity of user preferences (see 
Table 1), those methods that give more weight to user personality (i.e., those methods 
with A values close to 0) obtain the highest precision and recall values. This is not the 
case in the music domain, where the average number of rated items per user is larger 
than in the movies and books domains (see Section 4.1), and the relationships between 
user preferences and personality are the weakest ones (as will be shown in Section 5). 

In terms of average F-measure values, for most cases in the movies and books 
domains, the discrete profiles let obtained better results than the continuous profiles. 
Regarding the user similarities, there is no consensus on which is the best alternative 
for the studied domains. While in the music domain the COS methods are the best 
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performing, in the movies and books domains the PEA and SPE methods respectively 
achieve the highest precision and recall values. 

4.3 Discussion 

The obtained results show that the proposed 3-category personality profiles let 
generate better recommendations than those obtained with numeric personality 
profiles, which are the ones used previously in the literature. The discretization 
process of the Big Five presented in this paper follows a simple approach based on the 
personality factor distributions in the whole dataset. Approaches dependent on the 
target domain, considering a different number of personality factor categories, or 
based on certain user aspects may help improve generated recommendations. 

The nature of the utilized dataset, which comprises like preferences for domain 
genres, entailed the use of the Jaccard user similarity (formula 1) in the heuristic- 
based collaborative filtering strategy for the top N recommendation task. If either 
binary like/unlike records or numeric ratings were available as user preferences, more 
elaborated collaborative filtering methods -e.g. those based on the Pearson’s and 
Spearman’s correlation coefficients- could be investigated for both the top N and 
rating prediction recommendation tasks. 

Moreover, although precision and recall metrics are appropriate and widely used for 
evaluating the top N recommendation task, other metrics could be explored when 
assessing personality-aware collaborative filtering methods. In particular, measuring 
recommendation diversity may be of special interest. As shown in previous studies [2, 
14], people open to experience tend to have preferences for more diverse types of items. 

5 Relationships between User Preferences and Personality 

As done in previous work (see Section 2.2), here we report relationships existing 
between user preferences and personality. Tables 6 and 7 show linear correlation 
values between preference- and personality-based user similarities for all distinct 
pairs of users in each domain. We report such correlation values for different groups 
of users based on their gender and age, as explained in Section 4.1. All values marked 
with asterisk (*) are statistically significant at p<0.05. 

When considering all users in a domain, it can be seen that correlation values are 
close to 0, meaning that there is no (linear) relations between user preferences and 
personality factors. If we consider age-based groups, the relations are strong for 
adolescent users in the movies domain, and are moderate for middle-aged adults in 
the movies and music domains. On the other hand, if we consider gender-based 
groups, we can observe that the correlation values are higher between female users in 
the movies domain and, in less degree, in the books domain. In the music domain, 
there is no clear distinction for correlations between male and female users. Finally, 
we note that there are no clear patterns between the correlations obtained with the 
cosine- and Pearson/Spearman-based personality profiles. 
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Table 6. Linear correlations between preference- and personality-based user similarities 



Table 7. Linear correlations between preference- and personality-based user similarities 



6 Conclusions and Future Work 

In this paper we have presented an empirical study comparing several collaborative 
filtering methods that effectively exploit information about the users’ personality. In 
addition to evaluating state of the art approaches, we have assessed new personality- 
aware recommendation strategies and user profiles, and have used large datasets in 
several domains. Moreover, we have analyzed relationships between user preferences 
and personality considering the users’ gender and age, which has revealed differences 
in the preference-personality patterns, calling for further work in the future. 
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In our study, however, we have focused on broad representations of the users’ 
personality by assuming the same relevance for all the Big Five personality factors. 
As previous work has shown [3, 14, 16], personality factors may influence differently 
on user preferences for certain types of items. In the future we shall investigate how 
fine grained relationships between user preferences and personality factors can be 
exploited in recommendation. Moreover, we also envision the consideration of 
personality facets (in addition to or instead of personality factors) as a potential way 
to exploit personality information in collaborative filtering. 

Our investigation has revealed there are differences in the relationships between user 
preferences and personality according to the users’ gender and age. Nonetheless, we 
have not exploited such particularities for recommendation purposes yet. In this context, 
other user attributes, such as educational attainment, could be taken into account as well. 

Finally, we also plan to evaluate new personality-aware collaborative filtering 
approaches, e.g. methods based on factor models, and strategies focused on neighbor 
selection and rating matrix enrichment, instead of on user similarities. 
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